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be used to determine the properties of dark energy and the nature 
of gravity, because light from those galaxies is bent by gravity from 
the intervening dark matter. The observed galaxy images appear dis- 
torted, although only slightly, and their shapes must be precisely 
disentangled from the effects of pixelisation, convolution and noise. 
The worldwide gravitational lensing community has made significant 
progress in techniques to measure these distortions via the Shear 
TEsting Program (STEP). Via STEP, we have run challenges within 
our own community, and come to recognise that this particular im- 
age analysis problem is ideally matched to experts in statistical in- 
ference, inverse problems and computational learning. Thus, in or- 
der to continue the progress seen in recent years, we are seeking an 
infusion of new ideas from these communities. This document de- 
tails the GREAT08 Challenge for potential participants. Please visit 
www.great08challenge.info for the latest information. 

1. Introduction. Our Universe appears to be dominated by dark mat- 
ter and dark energy [Biello and Caldwell (2006), Linder and Perlmutter 
(2007)]. These are not well described or even understood by modern science, 
so studying their properties could provide the next major breakthrough in 
physics. This may ultimately lead to a discovery of a new class of fundamen- 
tal particle or a theory of gravity that supersedes Einstein's theory of general 
relativity. For this reason, the primary science drivers of most cosmological 
surveys are the study of dark matter and dark energy. Funding agencies 
worldwide have committed substantial resources to tackling this problem; 
several of the planned projects will spend tens to hundreds of millions of 
taxpayers' Euros on this topic. 

Many cosmologists have concluded that gravitational lensing holds the 
most promise to understand the nature of dark matter and dark energy 
[Albrecht et al. (2006), Peacock et al. (2006)]. Gravitational lensing is the 
process in which light from distant galaxies is bent by the gravity of inter- 
vening mass in the Universe as it travels towards us. This bending causes the 
shapes of galaxies to appear distorted [Bartelmann and Schneider (2001), 
Wittman (2002), Refregier (2003b) and Munshi et al. (2006)]. We can relate 
measurements of the statistical properties of this distortion to those of the 
dark matter distribution at different times in the history of the Universe. 
From the evolution of the dark matter distribution we can infer the main 
properties of dark energy. 

To extract significant results for cosmology, it is necessary to measure 
the distortion to extremely high accuracy for millions of galaxies, in the 
presence of observational problems such as blurring, pixelisation and noise 
and theoretical uncertainty about the undistorted shapes of galaxies. Our 
techniques are good enough to analyse current data but we need a factor of 
ten improvement to capitalise on future surveys, which requires an injection 
of new ideas and expertise. We challenge you to solve this problem. 
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Section 2 explains the general problem and presents an overview of our 
current methods. Section 3 describes in detail the GREAT08 Challenge sim- 
ulations, rules and assessment. We conclude in Section 4 with a summary of 
the additional issues that arise in more realistic image analysis, that could 
be the basis of future GREAT Challenges. 

2. The problem. For the vast majority of galaxies the effect of gravita- 
tional lensing is to simply apply a matrix distortion to the whole galaxy 
image 



where a positive "shear" g\ stretches an image along the x axis and com- 
presses along the y axis; a positive shear g2 stretches an image along the 
diagonal y = x and compresses along y = —x. The coordinate (x u y u ) de- 
notes a point on the original galaxy image (in the absence of lensing) and 
(x\ yi) denotes the new position of this point on the distorted (lensed) im- 
age. There is also an isotropic scaling that we ignore here. This seems a 
sensible parameterisation to use for the shear because it is linear in the 
mass [e.g., Kaiser, Squires and Broadhurst (1995)]. The top left two panels 
of Figure 2 illustrate an exceptionally high quality galaxy image before and 
after application of a large shear. For cosmic gravitational lensing a typical 
shear distortion is gi ~ 0.03, therefore a circular galaxy would appear to be 
an ellipse with major to minor axis ratio of 1.06 after shearing. Note that 
the three-dimensional shape of the galaxy is not important here; we are 
concerned only with the two-dimensional (projected) shape. 

Since most galaxies are not circular, we cannot tell whether any individual 
galaxy image has been sheared by gravitational lensing. We must statistically 
combine the measured shapes of many galaxies, taking into account the 
(poorly known) intrinsic galaxy shape distribution, to extract information 
on dark matter and dark energy. Shear correlations were first measured 
in 2000 [Bacon, Refreiger and Ellis (2000), Kaiser, Wilson and Luppino 
(2000), Wittman et al. (2000) and van Waerbeke et al. (2000)] and the most 
recent results [Massey et al. (2007c), Fu et al. (2008)] use millions of galaxies 
to measure the dumpiness of dark matter to around 5 percent accuracy. 
Figure 1 shows a three-dimensional map of the dark matter reconstructed by 
Massey et al. (2007b). Future surveys plan to use roughly a billion galaxies 
to measure the dark matter dumpiness to extremely high accuracy and 
thus measure the properties of dark energy to 1 percent accuracy. This will 
require a measurement accuracy on each of g\ and 52 of better than 0.0003. 
However this can only be achieved if statistical inference problems can be 
overcome. 



(2.1) 
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Fig. 1. Illustration of the invisible dark matter distribution inferred using gravitational 
lensmg detected in the Cosmic Evolution Survey (COSMOS) [Massey et al. (2007b)J. The 
three axes of the box correspond to sky position (in right ascension and declination), and 
distance from the Earth increasing from left to right (as measured by cosmological redshift). 
Image credit: NASA, ESA and R. Massey (California Institute of Technology). 

Shear measurement is an inverse problem, illustrated in Figures 2 and 3. 
The forward process is illustrated in Figure 2: (i) each galaxy image begins as 
a compact shape, which appears sheared by the operation in equation (2.1); 
(ii) the light passes through the atmosphere (unless the telescope is in space) 
and telescope optics, causing the image to be convolved with a kernel; (iii) 
emission from the sky and detector noise cause a roughly constant "back- 
ground" level to be added to the whole image; (iv) the detectors sum the 
light falling in each square detector element (pixel); and (v) the image is 
noisy due to a combination of Poisson noise 23 in the number of photons 
arriving in each pixel, plus Gaussian noise due to detector effects. The ma- 
jority of galaxies we need to use for cosmological measurements are faint: a 
typical uncertainty in the total amount of galaxy light is 5 percent. 



Poisson noise arises because there is a finite number of photons arriving at the detector 
during the fixed length of time that the shutter is open. The probability of receiving n 
photons in a pixel is therefore given by Pr(n|A) = A"e _A /n! where A is the mean number 
of photons observed in that pixel during many exposures of the same length of time. 
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The Forward Process. 

Galaxies; Intrinsic galaxy shapes to measured image; 




Intrinsic galaxy Gravitational lensing Atmosphere and telescope Detectors measure Image also 

(shape unknown) causes a sheer (g) cause a convolution a ptKelated image contains noise 



Stars: Point sources to star images: 




Intrinsic star Atmosphere and telescope Detectors measure Image aho 

(point source) cause a convolution a pixelated image contains noise 



Fig. 2. Illustration of the forward problem. The upper panels show how the original galaxy 
image is sheared, blurred, pixelised and made noisy. The lower panels show the equivalent 
process for (point-like) stars. We only have access to the right hand images. 

Stars are far enough away from us to appear point-like. They therefore 
provide noisy and pixelised images of the convolution kernel (lower panels of 
Figure 2). The convolution kernel is typically of a similar size to the galaxies 




Fig. 3. Illustration of the inverse problem. We begin on the right with a set of galaxy and 
star images. The full inverse problem would be to derive both the shears and the intrinsic 
galaxy shapes. However shear is the quantity of interest for cosmologists. 
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we are observing. If it were not accounted for, we would therefore underesti- 
mate the shear. The kernel can also be up to ten times more elliptical than 
the ellipticity induced by gravitational shear. If this is improperly accounted 
for, it can masquerade as the cosmological effect we are trying to measure. 
In real astronomical observations, the kernel varies across a single image 
containing hundreds of stars and galaxies, and also from one image to the 
next. Since stars are distributed all over the sky we can use nearby stars on 
a given image to estimate the kernel for a given galaxy. 

The most significant obstacle to shear measurement is that the intrinsic 
shape of each galaxy is unknown. Even the probability distribution function 
of possible shapes from which it could have been drawn is highly uncertain; 
we do not even have a good parameterisation for galaxy shapes. We try to 
categorise galaxies into three types: spirals (e.g., Figure 2), ellipticals and 
irregulars but many galaxies are somewhere between the categories. 

One good assumption that we can make is that unlensed galaxies are 
randomly oriented. In addition we find that the radially averaged ID galaxy 
light intensity profile I(r) is well fit by I(r) = I Q exp(— {r/r c ) l / n ) [Sersic 
(1968)], where I Q , r c and n are free parameters and r is the distance from the 
centre of the galaxy. For elliptical galaxies n ~ 4 ( "de Vaucouleurs profile" ) 
and for spirals n ~ 1 ("exponential profile"). Unfortunately we do not have 
suitable galaxy images which are free of pixelisation and convolution from 
which to learn about intrinsic galaxy shapes. We can however make low 
noise observations of some small areas of sky. 

Methods developed so far by the lensing community are discussed in detail 
in the appendices and references therein. At the Challenge launch we will 
provide code implementing some existing methods. Their performance on 
earlier blind challenges is discussed in Heymans et al. (2006) and Massey et 
al. (2007a). In all existing methods each star is analysed to produce some 
information about the convolution kernel. This is averaged or interpolated 
over a number of stars to reduce the noise and produce information about 
the kernel at the position of each galaxy. The galaxy image is analysed, 
taking into account the kernel, to produce an estimate of the shear (g± and 
52) at the position of that galaxy. 

Real astronomical data is simply an image of the continuous night sky. 
The first step of any analysis pipeline is therefore to identify stars and galax- 
ies (distinguishing small, faint galaxies from small, faint stars in a noisy 
image is a nontrivial task), cut out images around them and estimate the 
local background level. Since the convolution kernel also usually varies as 
a function of time and image position, the apparent shapes of stars must 
be modelled, and the model coefficients interpolated to the positions of the 
galaxies. Simplifications have been made in the GREAT08 data to eliminate 
these steps. 
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In real data the shear fields g\ and 52 vary across the sky due to the 
dumpiness of dark matter in the Universe. They also vary with the distance 
of the galaxy. It is usually reasonable to assume that the shear is constant 
across the image of a single galaxy. In practice the shear is different for each 
galaxy but is zero when averaged over a large survey, that is, (g\) = (#2) = 0. 
It is necessary to use images of both stars and galaxies to extract the shear 
field in the presence of the unknown convolution kernel. In this process our 
priority is not to learn about the properties of the unlensed galaxy images. 

Conventionally, the shear information from each galaxy image is combined 
to produce a statistic that can be predicted from a cosmological model. 
For example, the most common statistic is the shear correlation function 
(dii9ij) + (92i92j) [e.g., Bartelmann and Schneider (2001)] where the averages 
are carried out over all galaxy pairs i and j at a given angular separation on 
the sky. The properties of dark matter and dark energy can then be inferred 
by calculating the probability of the observed statistics as a function of 
cosmological parameters. The whole process is illustrated in Figure 4. Note 
that GREAT08 focuses entirely on the process of going from image to shear 
estimate because this is the current bottleneck that is hindering further 
analysis of astronomical data. However shear measurement methods will 
ultimately need to fit into this larger scheme to be useful for cosmology. 

3. The GREAT08 Challenge. In the previous section we described the 
general cosmic lensing problem. In this section we focus on the specifics 
of the GREAT08 Challenge. We start by describing the properties of the 
GREAT08 simulations. We explain how the results are assessed and the 
winner determined. 

3.1. Simulations. The Challenge images are made by simulation, using 
the flowchart of the forward problem (Figure 2). We have made a number 
of simplifications which we aim to relax for future GREAT Challenges, as 
discussed in Section 5. The simulations consist of many small (roughly 40 
by 40 pixel) images, each containing a single object. The images are clearly 
labelled as either stars (kernel image) or galaxies. The objects are located 
roughly, but not exactly, in the centre of each image. The images are divided 
into different "sets," each containing thousands of images. All the images 
within a set have identical values of the shear g\ and 52 and an identical 
convolution kernel. A very large constant is added to all pixels in a set and 
Poisson noise is added to each pixel. For GREAT08 RealNoise-Known and 
GREAT08 RealNoise-Blind (see below) the constant is so large that the 
noise is very close to being Gaussian with the same variance for every pixel 
in the image. You may use all these facts in your analysis. 

The star images in each set provide information on the convolution ker- 
nel. To simplify the Challenge we also provide the equations used to make 
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A full weak tensing pipeline: 

The broader context typical for cosmological measurements 




Apply statistic 
(e.g. correlation of 
shear in pairs of 
galaxies r apart) 




Dark matter 
Dark energy 
Cosmological physics 



Fig. 4. Flowchart indicating the extent of a full conventional cosmic gravitational lensing 
data analysis pipeline, from measuring the convolution kernel using the shapes of stars, to 
measurements of cosmology. The GREAT08 Challenge focuses exclusively on the steps en- 
closed in the box made by the dashed black line. The final winners will be determined based 
solely on estimates of shear. Simulation credit: Andrey Kravtsov (University of Chicago). 

these kernel images. Therefore you have the choice of whether to use the 
exact equations or the star images provided. In each star image the star 
has a different centre position and different random noise realisation. The 
noise level and number of star images should be sufficient to reconstruct 
the convolution kernel to a precision where uncertainties in the convolution 
kernel are smaller than the small uncertainty due to the finite number of 
galaxies. Your challenge is to derive an estimate of the shear applied to the 
galaxy images within each set. 
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This Challenge is difficult because of the following realistic features: (i) 
the extremely high accuracy required on the final answer; (ii) a model for the 
galaxies is not provided, and the galaxy shape and position are different from 
image to image (drawn from some underlying model which is not disclosed) ; 
(hi) there is convolution and noise; (iv) images are pixelised. 

The GREAT08 galaxy image simulation types are summarized in Ta- 
ble 1. To make the Challenge more approachable there are a few sets of 
low noise simulations ( "GREAT08 LowNoise-Known" and "GREAT08 Low- 
Noise-Blind"). The true shear values are provided for a subset of these 
("GREAT08 LowNoise-Known" ) and there is a blind competition for the 
remainder ( "GREAT08 LowNoise-Blind" ) . The main challenge ( "GREAT08 
RealNoise-Blind" ) has a realistic, much higher, noise level. There are also 
some sets with a realistic noise for which the true shears are provided 
("GREAT08 RealNoise-Known" ) . It is not possible to determine the true 
shear of a galaxy, even with an infinite amount of data. Therefore a method 
that requires a perfect training set will not be useful in practice. However we 
will be able to make simulations of the sky using imperfect galaxy models. 
To simulate this future situation, we use a slightly different model for the 
galaxies in the "Known" sets than in the "Blind" sets. This means that al- 
though methods that require a training set can be used (see rule 4), they may 
be at a small relatively realistic disadvantage, depending on the sensitivity 
of the method to the galaxy properties. 

3.2. Results. Each submission consists of a shear estimate {g\ and §2) for 
each set of images, with associated 68 percent error bars. A quality factor is 
calculated for each submission using the differences between the submitted 
and true shear values. 

The goal of the Challenge is to successfully recover the true input shear 
values used in the simulation, <?* -, g\j, for each set of images, j. You may use 
whatever method you like to combine the shear information from each galaxy 
within a set to estimate the shear applied to the whole set. The submitted 
shear values, g^, g^j, will differ from the true values due to the noise on the 
images and due to any biases induced by the measurement method. A good 
method would both filter the noise effectively and have small or nonexistent 



Table 1 

Summary of the three GREAT08 simulation suites 





True shears provided 


Blind competition 


Low noise 
Realistic noise 


GREAT08 LowNoise-Known 
GREAT08 RealNoise-Known 


GREAT08 LowNoise-Blind 
GREAT08 RealNoise-Blind 
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biases. We define the quality factor in terms of the mean squared error 



where the inner angle brackets denote an average over sets with similar shear 
value and observing conditions j Ek. The outer angle brackets denote an 
average over simulations with different true shears and observing conditions 
k and shear components i [see Kitching et al. (2008)]. 

This deliberately designed to reward methods that have small biases. This 
is important because in cosmology we average over a very large number of 
galaxies and any remaining bias will bias cosmological parameters. This 
definition will also penalise methods that have small biases at the expense 
of being extremely noisy. 

This quantity does not include the error bars you submit. We are not in- 
terested in a method which has large but accurate error bars since it will not 
produce tight cosmological constraints. Furthermore the Challenge images 
cover only a small (but realistic) range of observing conditions, therefore 
it is unlikely that an ultimately useful method would lose the competition 
because of poor performance in a particular corner of observing condition 
parameter space where your method has particularly large error bars. 

It has been shown that a systematic variance (({g^ — 9ij)jek) 2 )ik < 10 -7 
will be needed to fully utilise future cosmic lensing data sets [Amara and 
Refregier (2007)], corresponding to Q = 1000 [see also Huterer et al. (2006), 
Van Waerbeke et al. (2006)]. The number of galaxy images included in 
GREAT08 LowNoise-Blind and in GREAT08 RealNoise-Blind are sufficient 
to test Q to this value. If a single constant value of zero shear were sub- 



mitted (gij = g%j = for all j) then since ^(gij)ij ~ 0.03 it follows that 
Q ~ 0.1. The existing methods that have been used to analyse astronomical 
data have Q ~ 10, which was sufficient for those surveys. 

The GREAT08 Challenge Winner is the entry with the highest Q value 
on GREAT08 RealNoise-Blind. These will be publicly available on a leader 
board, as mocked-up in Table 2. Results using several existing methods 
appear on the leader board at the start of the Challenge, to show the current 
state-of-the art. 

The main diagnostic indicator in the leader board is the quality factor 
Q, which determines the ranking of the submissions. As discussed above, 
the quality factor does not take into account the submitted uncertainty 
estimates on the shears, whereas an ideal method would calculate these 
reliably. We make an internal estimate of the uncertainties and compare 
with your submission to produce an error flag. If the uncertainty estimates 
are on average wrong to more than a factor of two then this is flagged in 
the leader board. There are no consequences of the error flag in determining 




Q 
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Table 2 

A mock leader board, showing a potential range of results. Submissions by members of the 
GREAT08 Team are marked with an asterisk. There are two leader boards: one for 
GREAT 08 LowNoise- Blind and one for GREAT08 RealNoise-Blind 









Error 


Number of 


Date of last 


Name 


Method 


Q 


flag 


submissions 


submission 


A. Einstein 


BcstLcts 


1001 




15 


25 Dec 2008 


Team Bloggs 


Joel 


582 


Warning 


2 


2 Nov 2008 


Dr. Socrates 


ArcheoShapes 


116 


Warning 


212 


23 Sept 2008 


W. Lenser* 


KSB+ + + 


99 




12 


10 Aug 2008 


A. Monkey 


Guess Again 


1.2 


Warning 


5 


30 Nov 2008 



the winner. The winner may have an error flag warning and will still win, 
based on their Q value. 

The data for which true shears are provided (GREAT08 LowNoise-Known 
and GREAT08 RealNoise-Known) are released publically in July 2008. The 
challenge data (GREAT08 LowNoise-Blind and GREAT08 RealNoise-Blind) 
are released in fall 2008 and the deadline will be 6 months after the release 
of the challenge data. Please see www.great08challenge.info for the latest 
information and discussions in the GREAT08 section of CosmoCoffee at 
www.cosmocoffee.info. The Challenge deadline is to be followed by a more 
detailed report making use of the internal structure of the simulations to 
identify which observational conditions favour which method. We hope this 
will lead to a publication and workshop. 

4. Conclusions. The field of cosmic gravitational lensing has recently 
seen great successes in measuring the distribution of dark matter. Indeed, 
hundreds of millions of Euros will soon be spent on exciting new surveys to 
determine the nature of the two fundamental (yet quite mysterious) materi- 
als that are the most common in our Universe. Uniquely among cosmological 
tests, measurements of cosmic lensing are not currently limited by compli- 
cated astrophysical processes occurring half-way across the Universe, but 
by improved techniques for statistical image analysis right here on Earth. 
Cosmologists have hosted several shear measurement competitions amongst 
themselves, and developed several methods that achieve an accuracy of a few 
percent. However, reaching the accuracy required by future surveys needs a 
fresh approach to the problem. The GREAT08 Challenge is designed to seek 
out your expertise. Aside from the academic interest in solving a challenging 
statistical problem, successful methods are absolutely essential for further 
advances in cosmological investigations of dark matter and dark energy. 

GREAT08 marks the first time that the challenge of high precision galaxy 
shape measurements has been set outside the gravitational lensing commu- 
nity, and as such marks a first step in a global effort to develop the next 



12 



S. BRIDLE ET AL. 



generation of cosmological tools using expertise, experience and techniques 
coming from a broad disciplinary base. The field of gravitational lensing is 
expected to grow at an increasing rate over the coming decade but an injec- 
tion of new ideas is vital if we are to take full advantage of the potential of 
lensing to be the most powerful cosmological probe. The GREAT Challenges 
can therefore be seen as a comprehensive series where the goal of each step 
is both to bring new insight and to tackle more complicated problems than 
the previous step. 

5. GREAT08 simplifications and future challenges. The GREAT08 Chal- 
lenge outlined in this document is a difficult challenge despite the simplifying 
assumptions which include: 

• Constant Shear: Within each set of images the shear is constant whereas 
in real data shear is a spatially varying quantity from which correlation 
statistics are used to measure properties of the Universe. 

• Constant Kernel: Within each set the convolution kernel is constant whereas 
in real data this is a spatially and time varying quantity that also needs 
to be measured and interpolated between galaxy positions. 

• Simple Kernel: The convolution kernels used in this Challenge are simple 
relative to those of real telescopes. 

• Simple Galaxy Shape: The galaxies used in this Challenge are simple 
relative to real data. 

• Simple Noise Model: The noise is Poisson. In practice there would be un- 
usable bad pixels which may be flagged and the noise would be a combina- 
tion of Gaussian and Poisson, with the Gaussian contribution potentially 
varying across the image. 

• Image Construction: In GREAT08 there is only one object in each small 
image and each is labelled according to whether it is a star or a galaxy. The 
selection of galaxies in a real image must not depend on the applied shear 
otherwise this introduces an additional bias: if very elliptical galaxies are 
preferentially downweighted then galaxies that happen to be aligned with 
the shear will tend to be lost which will bias the measured shear low. 
In addition, in real data some galaxies overlap and are best discarded 
from conventional analyses. Furthermore, conventional analyses rely on 
accurate labelling of stars and galaxies. 

In GREAT09 we anticipate that many of these assumptions would be 
relaxed therefore methods which perform well in GREAT08 by overly capi- 
talising on the simplifications may not perform well in GREAT09. 

Beyond GREAT09 there are a multitude of further issues that have a 
significant effect on accurate shape measurement. Cosmic rays and satellite 
tracks contaminate the image [see Storkey et al. (2004)]; detector pixels 
vary in responsivity and the responsivity is not linear with the number of 
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photons (Charge Transfer Efficiency); the detector elements are not perfectly 
square and/or are not perfectly aligned in the telescope so that the sky 
coordinates do not perfectly map to pixel coordinates, and they bleed (Inter 
Pixel Responsivity); there are multiple exposures of each patch of sky, each 
with a different kernel. 

The ultimate test and verification of a method will be in its application to 
data. The goal of the GREAT Challenges is to encourage the development of 
methods which will one day be used in conjunction with state-of-the-art data 
in order to answer some of our most profound and fundamental questions 
about the Universe. 

APPENDIX A: RULES 

1. Participants may use a pseudonym or team name on the results leader 
board, however real names (as used in publications) must be provided 
where requested during the result submission process. 

2. Participants who have investigated several algorithms may enter once per 
method. Changes in algorithm parameters do not constitute a different 
method. 

3. Re-submissions for a given method may be sent a maximum of once per 
week during the 6 month competition. 

4. Since realistic future observations would include some low noise imag- 
ing, participants are welcome to use the GREAT08 Low-noise images to 
inform their GREAT08 Main analysis. We will never have observations 
for which the true shear is known, but we will be able to make our own 
attempts to simulate the sky, which could be used to train shear esti- 
mation methods. Therefore GREAT08 LowNoise-Known and GREAT08 
RealNoise-Known have slightly different galaxy properties than GREAT08 
LowNoise-Blind and GREAT08 RealNoise-Blind. GREAT08 LowNoise- 
Known and GREAT08 RealNoise-Known may be used to train the results 
of GREAT08 LowNoise-Blind and GREAT08 RealNoise-Blind. 

5. Participants must provide a report detailing the method used, at the 
Challenge deadline. We would prefer that the code is made public. 

6. We expect all participants to allow their results to be included in the final 
Challenge Report. We will however be flexible in cases where methods 
performed badly compared to the current methods if participants are 
strongly against publicising them. 

We will release the true shears after the deadline and you are encouraged to 
write research articles using the Challenge simulations. 

Some additional competition rules apply to members of the GREAT08 
Team who submit entries: 
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7. For the purpose of these rules, "GREAT08 Team" includes anyone who 
receives STEP and/or GREAT08 Team emails, and/or has the STEP 
password. The authors of this document all receive GREAT08 Team 
emails. 

8. Only information available to non-GREAT08 participants may be used 
in carrying out the analysis, for example, no inside information about the 
setup of the simulations may be used. 

Note that the true blind shear values will only be available to only a small 
subset of the GREAT08 Team. 

APPENDIX B: OVERVIEW OF EXISTING METHODS 

A variety of shear measurement methods have been developed by the cos- 
mic lensing community. Their goal is always to obtain an unbiased estimate 
g of the shear, such that the mean over a large population of galaxies is 
equal to the true shear (g) = g. However, they adopt different approaches to 
correct the nuisance factors in Figure 4 (convolution, pixelisation and noise). 

Most of the methods have been described, and tested on simulated im- 
ages, during the Shear TEsting Programme (STEP) [Heymans et al. (2006), 
Massey et al. (2007a)] and earlier [Bacon et al. (2001), Erben et al. (2001), 
Hoekstra et al. (2002)]. To summarise the current level of knowledge, but try- 
ing not to restrict the development of new ideas, we present here an overview 
of an idealised method. In Appendices C-G, we then provide a more detailed 
introduction to several methods that have been used on real astronomical 
data, with links to research papers. At the launch of the GREAT08 Chal- 
lenge, code for these methods will be made available and the corresponding 
results will be entered on the GREAT08 leader board. 

Potential participants may be interested in applying methods that require 
a set of training data which matches the Challenge data. We do not pro- 
vide such a set because this will not be available for realistic observations. It 
would in principle be possible to simulate data with similar properties to the 
observed data, but this will not match exactly because of our lack of knowl- 
edge of the detailed shapes of distant galaxies. We do not know whether 
or not this presents a fundamental limitation for this type of method. The 
(public) STEP1 and STEP2 simulations have a similar noise level to the 
GREAT08 images and the true shear is given. You are allowed to use these 
to train your methods if you wish. The galaxy properties are not the same 
as those in GREAT08 so this is a reasonable approximation to the realistic 
situation. However the objects are not isolated on postage stamps as for 
GREAT08. 
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B.l. Ellipticity measurement. We first describe a simple shear measure- 
ment method that would work in the absence of pixelisation, convolution 
and noise. The centre of the image brightness I(x,y) can be defined via its 
first moments 

/ I(x,y)x dx dy 



(B.l) 
(B.2) 



JI(x,y)dxdy ' 
I ' I{x,y)ydxdy 



I ' I{x,y)dxdy ' 
and we can then measure the quadrupole moments 

/ I(x, y) (x — x) 2 dx dy 



.(■.(■ 



(B.3) Q. 
(B.4) Q xy 
(B.5) Q yy 



JI(x,y)dx dy 

I Hx, y) ( x -x)(y- y) dx dy 

Jl(x,y)dx dy 
fl(v,y)(y - yfdx dy 



JI(x,y)dx dy 

Gravitational lensing maps the unlensed image, specified by coordinates 
(x u ,y n ), to the lensed image (x[,y\j using a matrix transformation 

(B.6) ( x A=a( Xi 

where 

(B.7) A=( 1 - 9 ' ~f 

Throughout GREAT08, the components of shear g\ and 52 are constant 
across the image of a galaxy; this is usually a good approximation in real 
images too. Under this coordinate transformation, it can be shown that 
quadrupole moment tensor Q transforms as 

(B.8) Q u = AQ l A T , 

where Q n is the quadrupole moment tensor before lensing and Q 1 is that 
after lensing. 

The overall ellipticity of a galaxy image can be quantified by the useful 
combination of moments [Bonnet and Mellier (1995)] 



(B.9) e = e 1 + ie 2 



Qxx Qyy + ^iQxy 



where we introduce the complex notation e = e\ + ie 2 and g = g\ + ig 2 where 
i 2 = —1. For a simple galaxy that has concentric, elliptical isophotes (con- 
tours of constant brightness) with major axis a and minor axis b, and angle 
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9 between the positive x axis and the major axis, 

(B.IO) ei = ^ cos(2 0), 

a + b 

(B.ll) e2 = ^ sin ( 2 0). 

a + b 

The quantity e transforms under shear as 
(B.12) 



l ^+g 



1 + g*e u 

for \g\ < 1, where the asterisk denotes complex conjugation [Seitz and Schnei- 
der (1997)]. This can be Taylor expanded to first order in g, for each of the 
two components i £ 1,2. 

To obtain measurements of g, we next assume that there is no preferred 
orientation for the shapes of galaxies in the absence of lensing. In this case, 
when averaged over a large population of galaxies, (ef) = (ei?) = 0, (e" 2 ) = 
(eJ> 2 ) and (efe^) = 0. Therefore, on Taylor expanding (B.IO) to first order 



in g, we see that e\ is roughly a very noisy estimate of gi since y (ef 2 ) ~ 
0.15, which is an order of magnitude larger than the typical value of g%. On 
applying the symmetries for a large population we find 

(B.13) (^)~g. 

The need to sample a population of galaxies also explains the use of complex 
notation for both e and g: the two components of e average cleanly to zero 
in the absence of cosmic lensing, unlike a notation involving magnitude and 
angle. See Figure 5 for a graphical representation of these parameters. 
More commonly considered is the combination of quadrupole moments 



(B.14) X 



Qxx Qyy "b 2iQxj/ 



Qxx ~\~ Qyy 

(sometimes known as "polarisation"), where we define components x = Xi + 
1X2 as before. This combination is more stable than e in the presence of noise. 
A purely elliptical shape has 

(B.15) Xl = ^Z^ cos {26), 

(B.16) X2 = 4^sin(20). 

a z + b z 

In general, x transforms under shear as 
(B.17) X 1 " * U + 2 * + ^ 



1 + M 2 + 25%^ 
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Fig. 5. The parameterisation of generalised ellipticity as two quantities ei and ei, show- 
ing the shape of isophotes of an elliptical galaxy. For example, a galaxy aligned with the y 
axis (9 = 90 degrees) has ei < and ei = 0. If this figure had shown individual ellipticity 
estimates like e or \, the orientations would be the same, but the elongations would vary. 



where denotes that the real part should be taken [Schneider and Seitz 
(1995)]. On Taylor expanding to first order in g and averaging over a pop- 



ulation for which = (X2) = 0) iXi 2 } = (X2 2 ) an d (X1X2) = we obtain 



Therefore if the variance of the unlensed ellipticities (xf 2 ) of the population 
is known then the shear can be approximately determined. For GREAT08 it 
may be possible to infer these ellipticity properties from the low noise sample 
since (xf 2 ) — (x^ 2 )- For more information, see Section 4 of Bartelmann and 
Schneider (2001) or Bernstein and Jarvis (2002). 

These are just two examples of generalised ellipticity estimates for a 
galaxy shape. All existing methods start in similar fashion, by construct- 
ing a mapping from the 2D image I(x,y) to a quantity with the rotational 
symmetries of an ellipse, such as e or x- For some methods, the mapping 
might involve a combination of quadrupole moments. To reduce contami- 
nation from neighbouring galaxies, and to limit the impact of noise in the 
wings of a galaxy, a weight function W(x,y) with finite support is normally 
included in equations (B.3), (B.4) and (B.5). Other methods might involve 
the fitting of a parametric (e.g., elliptical Gaussian, or exponential) model 
to the image, in which case the major and minor axes a and b are returned, 
along with the angle 0. Various basis functions have been tried for this mod- 
elling, including shapelets (Appendix D), sums of co-elliptical Gaussians (see 
Appendices E and G) de Vaucouleurs profiles (Appendix F). Each support 



(B.18) 
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a different range of potential galaxy shapes, and have had varying success 
on galaxies of different morphological type. 

B.2. Shear responsivity. Converting a general ellipticity measurement e 
into a shear estimate g also requires knowledge of how that ellipticity is 
affected by a shear. All existing shear measurement methods involve some 
form of ellipticity estimate and corresponding shear responsivity 

(B.19) 

(sometimes called the shear polarisability or susceptibility) so that 
(B.20) e \ = e K + p£ gj + (g 2 ), 

where j is summed over. In general, Pfj is a unique 2x2 tensor for each 
galaxy. The diagonal elements reflect how much a shear in one direction 
alters the ellipticity in the same direction, and the two diagonal elements 
tend to be similar. The off-diagonal elements reflect the degree to which a 
shear in one direction alters the ellipticity in the other, and tend to be small. 
For the present purposes, it is therefore reasonable to think of the shear 
responsivity for each ellipticity estimate as a scalar quantity P sh times the 
identity matrix. Expressions for P sh for three simple ellipticity measures are 
shown in Table 3. In general, shear responsivity depends on the ellipticity 
and cuspiness of an individual galaxy image and can even depend on the 
shear. For example, the axis ratio of a circle initially changes significantly 
under a small shear operation; but as the same shear is repeatedly reapplied, 
the object can tend toward a straight line but then its ellipticity can never 
increase further since |e| < 1. A shear estimate can then be formed via 

If we had access to the noise-free, unlensed galaxy image then we could 
calculate P sh for each galaxy. However the lensing signal does not change 



Table 3 

Some common ellipticity estimates and their corresponding shear responsivities, 
calculated to first order in g 

Ellipticity estimate Shear responsivity 



e=(^)(cos(20)+i S in(20)) 1 

2(1 -(; 

Eqn. (5-2) in Kaiser, Squires and Broadhurst (1995) 



(^)(cos(20) + l sin(20)) 2(1 - ( X f )) 



a 2 + b 

J" J(x,y)(x 2 — y^ -\-2ixy) W(x i y) dx dy 
J' I t y)(x 2 -^-y 2 )W (x ,y) dx dy 
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over time, and the strongest cosmic lensing signal is carried by the most 
distant — and therefore the faintest — galaxies. Measurements of P sh from 
the observed image are consequently very noisy. Since P sh is on the denom- 
inator of equation (B.21), errors in this quantity can contribute to potential 
biases and large wings in the global distribution of g. Getting it wrong in 
existing methods has typically led to a bias in shear measurement that is 
proportional to the shear ("multiplicative bias"). To reduce the noise and 
control bias, P sh is often averaged over or fitted from a large population 
of galaxies. It is typically fitted as a function of galaxy size and brightness 
(the distribution of true galaxy shapes is known to vary as a function of 
these observables) . However, the fitting function must be chosen carefully: 
shear responsivity often varies rapidly as a function of galaxy brightness, 
and existing methods have been found to be unstable with respect to the 
method used for this fitting. Sometimes P sh is also fitted as a function of 
ellipticity. This drastically overestimates the cosmological shear signal in in- 
trinsically elliptical galaxies, but this should average out over a population. 
The goal is merely to create a shear estimate that is unbiased for a large 
population. 

Shear responsivity thus represents the intrinsic morphology of an indi- 
vidual galaxy, or the morphology distribution for a population of galaxies. 
Although inferring the intrinsic shape distribution is not a goal in itself (see 
Figures 2 and 3), some aspect of it always needs to be measured. As dis- 
cussed in Appendix F, it arises in a Bayesian context as a prior on probability 
distribution for each shear estimate. 

B.3. Correcting for a convolution kernel. An image is inevitably blurred 
by a convolution kernel (generally known in astronomy literature as the Point 
Spread Function or PSF) introduced by the camera optics and atmospheric 
turbulence. The kernel is usually fairly compact, and two examples are given 
in Figure 6. The typical size is usually quantified by the Full Width Half 
Maximum (FWHM), which is the diameter where the light falls to half of 
the peak. Typically the FWHM is two or three pixels across, and of a similar 
size to the galaxies of interest. For a Gaussian kernel the Gaussian standard 
deviation is simply related to the FWHM via FWHM = 2^21n(2)o\ 

One approach to correct for the kernel, which is particularly useful for 
moment-based ellipticities, is to subtract the effects of the convolution ker- 
nel from both the ellipticity and the shear responsivity [e.g., equations (C.5) 
and (C.6)]. A second approach to correct for the convolution kernel, partic- 
ularly appropriate for fitting methods, has been a full deconvolution of the 
image. One fairly stable way to do this has been the forward convolution of 
a predefined set of basis functions with a model of the convolution kernel, 
followed by the fitting of these basis functions to the data. A deconvolved 
version of the image can then be reconstructed by using the derived model 
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Fig. 6. Detail of two realistic convolution kernels. The isophotal contours are logarith- 
mically spaced. 

coefficients with the (unconvolved) basis functions. This model can then 
be used to measure an ellipticity and shear responsivity. Getting this step 
wrong in existing methods can leave residual effects of (or overcorrects for) 
any anisotropy of the convolution kernel in the ellipticity estimate. This 
typically introduces a bias in shear measurement that is independent of the 
true shear ( "additive bias" ) . 

B.4. Correcting for pixelisation. Astronomical detectors for optical light 
count the total number of photons arriving in a region that we call a pixel. To 
a good approximation these pixels are on a square grid and do not overlap or 
have gaps between them. For methods that fit a model to each galaxy shape, 
including a forward convolution with the convolution kernel, pixelisation 
can in principle be incorporated easily. This is because integration within a 
square pixel is mathematically identical to convolution with a square top hat, 
followed by resampling at the centres of pixels. Since the observed images of 
stars have also been pixelised, they are already a rendering of the convolution 
kernel convolved with the square of the pixel. Deconvolving this naturally 
takes care of the pixelisation at the same time. In practice, models for the 
kernel are relatively smooth and may not capture the convolution with the 
square well. 
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No methods based on quadrupole moments, with correction for the con- 
volution kernel via subtraction of those moments, have yet included a proper 
treatment of pixelisation. Furthermore, for both types of method, an unex- 
plained difference has been observed [Massey et al. (2007d)]. This is particu- 
larly important to the cosmic lensing community because the design of some 
future telescopes currently feature only about 2 pixels across the FWHM of 
the convolution kernel. 

B.5. Averaging to remove noise. There are two contributions to the 
noise on a shear estimate g for a single galaxy. The first comes from the 
noise on the image, which is Poisson for GREAT08. The second comes from 
the fact that unlensed galaxies are not circular and thus it is not possible to 
tell for a single galaxy whether it is intrinsically elliptical or whether it is 
intrinsically circular and lensed by a strong shear. This can be beaten down 
by averaging the ellipticities of many galaxies. If the galaxies are in a similar 
location (or within the same set of GREAT08 images), the constant shear 
signal they contain will be all that remains. Unfortunately, almost all exist- 
ing shear measurement methods supply only a single (maximum likelihood) 
shear estimate for each galaxy, possibly with a single error bar (although 
see Appendices E and F). The PDF is not exactly a Gaussian, therefore a 
simple average is not the correct approach. 



APPENDIX C: EXISTING METHOD 1: WEIGHTED QUADRUPOLE 



Currently, the most widely used and oldest method for cosmic lensing 
analysis comes from the work of Kaiser, Squires and Broadhurst (1995), 
Luppino and Kaiser (1997) and Hoekstra et al. (1998), hereafter referred to 
as KSB+. The version of KSB+ made available for the GREAT08 challenge 
is the "CH" KSB pipeline documented in the STEP challenge [Heymans 
et al. (2006) and Heymans et al. (2005)]. The original KSB imcat software 
developed by Nick Kaiser is also available on request. 

KSB+ parameterises galaxies and stars according to their weighted quadru- 
pole moments 



where W is a Gaussian weight function of scale length r g , where r g is 
some measure of galaxy size such as the half-light radius and x\ = x — x, 
X2 = y — y- An ellipticity e is formed from these weighted moments using 
equation (B.14). The following KSB+ method details how to correct for the 
convolution kernel and get an unbiased estimate of the shear 7. 

The main limiting simplification in KSB+ is to assume that the convo- 
lution kernel can be described as a small but highly anisotropic distortion 



MOMENTS (KSB+) 





J I(x, y)xjXjW(x, y) dx dy 
J I(x,y)W(x,y)dxdy 
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convolved with a large circularly symmetric function. In most instances, this 
is not a good approximation to make, but the KSB+ method has proved to 
be remarkably accurate in practice. With this assumption, the "corrected 
ellipticity" of a galaxy (which it would have in perfect observations) e cor , is 
given by 

e a -e a - P a pPp, 

where p is a vector that measures the kernel anisotropy, and P sm is the smear 
responsivity tensor given in Hoekstra et al. (1998). The kernel anisotropy 
p can be estimated from images of stellar objects by noting that a star, 
denoted by an asterisk, has zero ellipticity (it is effectively a 5-function) 
before convolution: e* cor = 0. Hence, 

(C3) P, = (P sm %y a ° bs - 

The isotropic effect of the convolution kernel and the smoothing effect of the 
weight function W, can be accounted for by applying a tensor correction P 7 , 
such that 

(C4) eS* = <£ + J3j7* 

where e s is the intrinsic source ellipticity and 7 is the gravitational shear. 
Luppino and Kaiser (1997) show that 

(n pT — P sh P sm ( psm* \ - 1 psh* 

where P sh is the shear responsivity tensor given in Hoekstra et al. (1998) 
and p sm * and P sh * are the stellar smear and shear responsivity tensors, 
respectively. Combining the correction for the anisotropic part of the con- 
volution kernel [equation (C.4)] and the P J isotropic correction, the final 
KSB+ shear estimate 7 is given by 

(C6) la = (P^-}lef s -PI™P,]. 

This method has been used by many astronomers although different in- 
terpretations of the above formula have introduced some subtle differences 
between each astronomer's KSB+ implementation. 

Other methods inspired by KSB+ can be found in Hirata and Seljak 
(2003), Mandelbaum et al. (2005), Rhodes, Refregier and Groth (2000), 
Kaiser (2000) and Smith et al. (2001). 

APPENDIX D: EXISTING METHOD 2: SHAPELETS 

An orthonormal basis set, referred to as "shapelets," can be formed by the 
product of Gaussians with Hermite or Laguerre polynomials (in Cartesian 
or polar coordinates respectively). A weighted linear sum of these basis 
functions can model any compact image, including the irregular spiral arms 
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and bulges seen in galaxy shapes [Refregier (2003a), Massey and Refregier 
(2005)]. The shapelet transform acts qualitatively like a localised Fourier 
transform, and can be used to filter out high frequency features such as 
noise. 

The shapelet basis functions are not specifically optimised for the com- 
pression of galaxy shapes. However, they can be analytically integrated 
within pixels and have particularly elegant and convenient expressions for 
convolution and shear operations. After modelling both a galaxy shape and 
a convolution kernel as a linear combination of shapelet basis functions, con- 
volution can be expressed as a simple matrix multiplication [see also Berry, 
Hobson and Withington (2004)]. Deconvolution can be performed via a ma- 
trix inversion, although in practice appears more stable when performed 
via a forward convolution of the basis functions, then obtaining their co- 
efficients with a fast, least-squares fit. Shearing a shapelet model involves 
mixing between only a minimal number of model coefficients. 

Most of the parameters in a shapelet model are linear, which helps min- 
imise any potential biases that could arise when fitting faint, noisy images. 
Additional, nonlinear parameters are the overall scale size and the coordi- 
nates to the centre of the basis functions, plus the finite truncation order of 
the shapelet series. Each fitted nonlinear parameter requires a slower, nonlin- 
ear iteration to pre-defined goals. Some methods also use elliptical shapelet 
basis functions, derived by shearing circular shapelets: such methods require 
two additional nonlinear parameters (the two ellipticity components). 

Shapelet basis functions have been utilised in various ways, for both it- 
erative and noniterative shear measurement methods. There are three ap- 
proaches currently in the literature: 

• The shapelet modelling process is used to obtain a best-fit denoised, de- 
convolved and depixelised image from which quadrupole moments are cal- 
culated. Experiments with various functional forms for the radial shape 
of the weight function have been tried in Refregier and Bacon (2003) and 
Massey and Refregier (2005). Different weight functions provide a variety 
of benefits, primarily altering the shear responsivity factor (B.19). 

• A perfectly circular model galaxy with arbitrary radial profile is sheared 
and convolved until it best matches the observed image according to a 
least-squares criterion [Kuijken (2006)]. A subset of the shapelet basis is 
used as a way of allowing freedom in the radial profile. The probability 
distribution function of galaxy ellipticities is required, in order to calibrate 
how much of the shearing is required to account for intrinsic shapes. 

• A shapelet model for the galaxy is constructed which is "circular" by a 
particular definition. Unlike the previous bullet point, it need not be circu- 
larly symmetric, but is constrained to have zero ellipticity for a particular 
ellipticity definition. This is then sheared and convolved until it matches 
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the data. This is discussed by Bernstein and Jarvis (2002) and tested by 
Nakajima and Bernstein (2007). This similarly requires the probability 
distribution function of intrinsic galaxy ellipticities. 

Concerns have been raised that the Gaussian-based functions require a 
large number of coefficients to reproduce the extended, low-level wings of 
typical galaxies. If these wings are hidden beneath noise, and truncated in 
the model, the galaxy's ellipticity will be systematically underestimated. 
Initial experiments are attempting to replace the Gaussian part of shapelets 
with something better matched to galaxy shapes, like a sech or an expo- 
nential (Kuijken, in prep.). Appropriate polynomials can always be used 
to generate an orthonormal basis set, and this should extrapolate better 
into the wings. It might be possible to transfer experience with Gaussian 
shapelets to these new basis sets. The elegant image manipulation oper- 
ations would made significantly more complicated, and involve mixing be- 
tween many, nonneighbouring coefficients. However, the mixing matrices can 
still be pre-calculated for a given basis set as a look-up table. 

More information, links to the papers, and a software package for shapelet 
modelling in the IDL language can be obtained from http://www. astro. 
caltech.edu/~rjm/shapelets. Translations of the code into C++ and java 
may also be available upon request. 



Kuijken (1999) presented a maximum likelihood method in which each 
galaxy and convolution kernel is modelled as a sum of elliptical Gaussians. 
The implementation below follows Bridle et al. (2002) (im2shape) and Voigt 
and Bridle (2008). The model intensity -B(x) as a function of position x = 
(x,y) is 



where the inverse covariance matrix for each component Cj can be written 
in terms of the ellipse major and minor axes (a, and hi) as 



APPENDIX E: EXISTING METHOD 3: FITTING SUMS OF 

CO-ELLIPTICAL GAUSSIANS 





(E.2) 




(E.3) 




(E.4) 
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and the matrix is symmetric. Thus each Gaussian object component has 6 
parameters, which we consider to be the position of the centre Xj = (xi,yi), 
Kl = {o-i — bi)/(a,i + bi), 6i, ri = aibi and the amplitude Aj. Because the 
galaxy is a sum of Gaussians, convolution with the convolution kernel (an- 
other sum of Gaussians) is analytically simple. 

The likelihood of the parameters is calculated assuming that the noise on 
the image is Gaussian with unknown variance a and that an unknown con- 
stant background level b has been added to the image. The model parame- 
ter vector p thus consists of p = (a, b, x\,yi, |ei|, 6\, ah\, A%, . . . ,x n , y n , e n ,9 n , 
ab n ,A n ), where the subscripts denote the Gaussian component number and 
n is the number of Gaussian components that make up the object. To re- 
duce the number of parameters, the centre position, ellipticity and angle of 
all components in each galaxy are fixed to be the same. Thus each additional 
Gaussian contributes only two extra parameters. This is a significant limi- 
tation on the flexibility of the galaxy model, but makes the method more 
stable to noise in the image, and means that the shear estimate is equal to 
the ellipticity e of the Gaussian stack via equation (B.13). This scheme will 
not accurately model irregular galaxy shapes, but that is not the main goal. 

Each parameter in p is assigned a prior which allows the conversion to 
the posterior probability "P(p|D,PSF), assuming that the convolution kernel 
(PSF) is known exactly. Markov-chain Monte Carlo sampling is used to find 
the marginalised PDF in e\, £2 space. This must be combined with the PDF 
of unlensed galaxy ellipticities to find the PDF in gi, gi space. In practice 
the mean and standard deviation of the samples in e\ and e<i space are 
calculated. These are converted to shear estimates by adding the root mean 
square of the unlensed ellipticities ((e") 2 ) in quadrature with the standard 
deviation of the samples. 



APPENDIX F: EXISTING METHOD 4: LENSFIT— BAYESIAN SHEAR 



Lensfit is a model fitting shape measurement method that uses a Bayesian 
shear estimate to remove biases. A Bayesian estimate has the immediate 
advantage over likelihood based techniques in that, as described in Miller 
et al. (2007), due to the inclusion of a prior the shear estimate should be 
unbiased given an ideal shape measurement method and an accurate prior. 
Miller et al. (2007) also discuss how to remove any bias that occurs as a result 
of assuming that the prior is centred on zero ellipticity, which is assumed 
since the actual intrinsic distribution is unknown. 

For each galaxy a (Bayesian) posterior probability in ellipticity can be 
generated using 



ESTIMATE WITH REALISTIC GALAXY MODEL 
FITTING 
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where V(e) is the ellipticity prior probability distribution and C(y~i\e) is 
the likelihood of obtaining the ith set of data values y» given an intrinsic 
ellipticity (i.e., in the absence of lensing) e. By considering the summation 
over the data, the true distribution of intrinsic ellipticities can be obtained 
from the data itself 

(F.2) (I E«(e|y,)) = / * //(e) £ (y|e) de, 

where, on the right-hand side, the integration of the probability distribution 
gives the expectation value of the summed posterior probability distribution 
for the sample. e(y|e) is the probability distribution for y given e. This will 
yield the true intrinsic distribution under the conditions that e(y |e) = C(y |e) 
and "P(e) = /(e) (assuming the likelihood is normalised) from which we 
obtain 

(f.3) (^£ft(°ly)W(°) =/(«)■ 

i 

This is the equation that highlights the essence of the Bayesian shape mea- 
surement method, given a prior that is a good representation of the intrinsic 
distribution of ellipticities the estimated posterior probability should be un- 
biased. Kitching et al. (2008) discuss how to find the prior from a subset of 
the data itself. The shear is equal to the average expectation value of the el- 
lipticity with a factor d{e)i/dg which corrects for any incorrect assumptions 
about the prior 

where for an individual galaxy the (e) = J eP(e) de. The shear responsivity 
is calculated by finding the derivative of ellipticity with respect to the shear. 
Miller et al. (2007) show how this can be calculated directly from the prior 
and the likelihood in a Bayesian shear estimation method. 

To generate the full likelihood surface in (ei,e2), we fit a de Vaucouleurs 
profile to each galaxy image. This results in six free parameters per galaxy: 
position x, position y, e±, e2, brightness and a scale factor r. By doing the 
model fitting in Fourier space the brightness and position can be marginalised 
over analytically, leaving the ellipticity and radius to fit. The radius is then 
numerically marginalised over leaving a likelihood as a function of ellipticity. 
This likelihood is then used in the Bayesian formalism above to estimate the 
shear. 
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APPENDIX G: EXISTING METHOD 5: MODEL-FITTING METHOD 
WITH NONLINEAR DISTORTION TERMS 

This model-fitting method goes beyond those in which distortion is en- 
tirely parameterised by the linear effect of shear. In addition to ellipticity, 
nonlinear shapes are measured by using generalised versions of transforma- 
tion (2.1) that include second-order terms arising if the shear signal varies 
across the width of a galaxy (it does not in the GREAT08 simulations). The 
models simultaneously allow for the estimation of these nonlinear parame- 
ters, which should yield a more reliable estimation of shear, and are also of 
use in cosmology. 

This method uses a compact form for the generalised transformations 
through the use of complex variables {w = x + iy,w = x — iy}, where id is 
the complex conjugate of w. In this notation, equation (2.1) is simply written 
w n = w — gw l , where the superscripts "u" and "1" refer to the unlensed and 
lensed images respectively. That transformation can be generalised to 

(G.l) w u = w l - gw l - bw 12 - dw 12 - 2dw l w\ 

where additional nonlinear terms are introduced, with complex coefficients 
{b = b\ + i&2j d = d\ + id<{\ and d = d\ — icfo. See Irwin and Shmakova (2005) 
and Schneider and Er (2007) for details. 

This is a direct fitting method that uses an assumed model for a galaxy's 
radial profile F{r). The radial position r has a straightforward expression in 
the complex notation, with r 2 = x 2 + y 2 = ww. The intensity of the model 
as a function of position (x\,y\) for a lensed galaxy will have a form 

(G.2) F(w u - (w u ) ) -> F(w l - gw l - b(w 1 ) 2 - dw 12 - 2dw l w l - (w l ) ), 

where (wf)o is the centroid position. The function F(r) could be any radial 
profile function: for example a Gaussian, sum of Gaussians, a Gaussian times 
a Polynomial, de Vaucouleurs, exponential or a parametric spline function. 
This function represents a galaxy model before convolution with a kernel. 
It is convolved with the convolution kernel and then fitted to the galaxy 
image. 

Irwin and Shmakova (2005) and Irwin, Shmakova and Anderson (2007) 
used a Gaussian times a Polynomial profile as a model function 

(G.3) F(r u2 ) = (A + Br u2 + Cr u4 ) + e~ Dr ^ , 

where A is related to the intensity at the centre of the galaxy, B is for a 
better fit to an arbitrary behaviour at the origin, D is a cut-off scale that 
reflects the image size, and C can modify the behaviour as one approaches 
the size of the image. The "+" subscript indicates that if the polynomial 
has a value less than zero, it is to be set equal to zero. This is needed to 
avoid negative intensities, which would be unphysical. 
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The parameters of the radial profile {A,B,C,D}, the shape transforma- 
tion parameters {g,b,d} and the centroid position wq are determined by 
minimizing the norm 



where Ip is given by convolving F(r u2 ) with the PSF convolution kernel. In 
the model function Ip(r u2 ) : r u2 = w n w u , is understood to be a function of 
x 1 and y 1 through u) 1 and w . A weight to can be introduced to account for 
measurement uncertainty in each pixel if some are known to be more noisy 
than others. 

With the extra parameters b, d included in the shape distortion, as well 
as shear g, in addition to the radial shape parameters {A,B,C, D} and the 
centroid position, wo — ► (x\,yi), there are 12 variables to determine. The fit 
is done in several steps using a multi-dimensional Newton's method. At each 
step a subset of the 12 variables are allowed to vary. The curvature matrix for 
these parameters is computed then diagonalised, and eigenvectors with very 
small eigenvalues are not allowed to contribute to the function change at that 
step. The rate of convergence to a minimum is controlled by a parameter 
step size. 

This method has an advantage over other methods in that the models 
can represent a better fit to a galaxy image for galaxies with nonelliptical 
isophotes. In addition one of the challenging tasks of ellipticity measurements 
is in the definition of a galaxy's centroid. The centroid position is affected 
by the nonlinear terms and the simultaneous definition of these parameters 
will give a better centroid measurement. 
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